165 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English french italian
Availability:
Freely Available
License:
CC BY SA
Size:
11135674 entries Production Status:
Newly created-finished
Use:
Named Entity Recognition
-
Paper title:DBpedia Abstracts: A Large-Scale, Open, Multilingual NLP Training Corpus
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Martin Brümmer | University of Leipzig | DE | ||
| Author 2 | Milan Dojchinovski | InfAI, Germany and FIT CTU in Prague | CZ | ||
| Author 3 | Sebastian Hellmann | AKSW, University of Leipzig | DE | AKSW/KILT, Universität Leipzig | DE |
| Main Contact | Martin Brümmer | University of Leipzig | None |
Documentation:
Documentation with the corpus at resource URL
Written
Corpus,
Language Type:
Multilingual
Languages:
Danish Dutch Finnish Mandarin Chinese Standard Arabic
Availability:
Freely Available
License:
<Not Specified>
Size:
4.2 MByte Production Status:
Newly created-finished
Use:
Person Identification
-
Paper title:Creating and Curating a Cross-Language Person-Entity Linking Collection
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Dawn Lawrie | Loyola College in Maryland | None |
| Author 2 | James Mayfield | Johns Hopkins University | None |
| Author 3 | Paul McNamee | Johns Hopkins University | None |
| Author 4 | Douglas Oard | University of Maryland | None |
| Main Contact | Dawn Lawrie | Loyola University Maryland | US |
Documentation:
Documentation in English with DownloadLanguage Type:
Multilingual
Languages:
Dutch English Spanish french italian
Availability:
Freely Available
License:
OpenSource
Size:
5 * 5000 Production Status:
Newly created-finished
Use:
Sentiment Analysis
-
Paper title:Generating Polarity Lexicons with WordNet propagation in 5 languages
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Isa Maks | VU University Amsterdam | NL | ||
| Author 2 | Ruben Izquierdo | VU University | NL | ||
| Author 3 | Francesca Frontini | PRAXILING - Université Paul-Valéry Montpellier 3 | FR | ILC CNR - Pisa Italy | FR |
| Author 4 | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES | ||
| Author 5 | Piek Vossen | VU University Amsterdam | NL | ||
| Author 6 | Andoni Azpeitia | vicomtech | ES | ||
| Main Contact | Isa Maks | VU University Amsterdam | None |
Documentation:
yesLanguage Type:
Multilingual
Languages:
Dutch English Spanish italian
Availability:
Freely Available
License:
<Not Specified>
Size:
200k, ongoing work words Production Status:
Newly created-in progress
Use:
Document Classification, Text categorisation
-
Paper title:SenTube: A Corpus for Sentiment Analysis on YouTube Social Media
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Olga Uryupina | University of Trento | IT | ||
| Author 2 | Barbara Plank | University of Copenhagen | DK | University of Groningen | NL |
| Author 3 | Aliaksei Severyn | University of Trento | CH | ||
| Author 4 | Agata Rotondi | University of Trento | IT | ||
| Author 5 | Alessandro Moschitti | Qatar Computing Research Institute | US | ||
| Main Contact | Olga Uryupina | University of Trento | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch Spanish french italian
Availability:
Freely Available
License:
Creative Commons Attribution 4.0 International (CC BY 4.0) License: https://creativecommons.org/licenses/by/4.0/.
Size:
2 GByte Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:MIsA: Multilingual "IsA" Extraction from Corpora
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Stefano Faralli | University of Rome Unitelma Sapienza | IT |
| Author 2 | Els Lefever | LT3, Ghent University | BE |
| Author 3 | Simone Paolo Ponzetto | University of Mannheim | DE |
| Main Contact | Stefano Faralli | University of Rome Unitelma Sapienza | None |
Documentation:
http://web.informatik.uni-mannheim.de/misa/index.html
Multimodal/Multimedia
Educational materials and knowledge dissemination,
Language Type:
Multilingual
Languages:
Dutch English German Hungarian Polish
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Educational materials and knowledge dissemination
-
Paper title:Languagesindanger.eu - including multimedia language resources to disseminate knowledge and create educational material on less‑resourced languages
-
Paper track:Multimodality
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Dagmar Jung | University of Cologne | DE |
| Author 2 | KATARZYNA KLESSA | The Institute of Linguistics, Adam Mickiewicz University in Poznan, Poland | PL |
| Author 3 | Zsuzsa Duray | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 4 | Beatrix Oszkó | Research institute for Linguistics, Hungarian Academy of Sciences | None |
| Author 5 | Mária Sipos | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 6 | Sándor Szeverényi | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 7 | Zsuzsa Várnai | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 8 | Trilsbeek Paul | Max Planck Institute for Psycholinguistics, Nijmegen | NL |
| Author 9 | Tamás Váradi | Research institute for Linguistics, Hungarian Academy of Sciences | None |
| Main Contact | KATARZYNA KLESSA | The Institute of Linguistics, Adam Mickiewicz University in Poznan, Poland | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English German french italian
Availability:
From Owner
License:
CC BY-NC
Size:
35M sentences Production Status:
Newly created-in progress
Use:
Lexicon Creation/Annotation
-
Paper title:Corpora of Typical Sentences
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Lydia Müller | University of Leipzig | DE |
| Author 2 | Uwe Quasthoff | University Leipzig | DE |
| Author 3 | Maciej Sumalvico | University of Leipzig | DE |
| Main Contact | Maciej Sumalvico | University of Leipzig | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Dutch English Spanish french
Availability:
Freely Available
License:
<Not Specified>
Size:
still under collection, over 600 hours OtherProduction Status:
Newly created-in progress
Use:
Language and accent identification, Speaker identification, L2 accent acquisition and development, L1 attrition, intelligibility
-
Paper title:Semi-automatic annotation of the UCU accents speech corpus
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Rosemary Orr | University College Utrecht | NL |
| Author 2 | Marijn Huijbregts | Radboud University Nijmegen | NL |
| Author 3 | Roeland van Beek | University College Utrecht | NL |
| Author 4 | Lisa Teunissen | University College Utrecht | NL |
| Author 5 | Kate Backhouse | University College Utrecht | NL |
| Author 6 | David van Leeuwen | Radboud University Nijmegen | NL |
| Main Contact | David van Leeuwen | Radboud University Nijmegen | None |
Documentation:
Rosemary Orr, Hugo Quené, Roeland van Beek, Thari Diefenbach, David A. van Leeuwen, Marijn Huijbregts: An International English Speech Corpus for Longitudinal Study of Accent Development. INTERSPEECH 2011: 1889-1892
Written
<Not Specified>,
Language Type:
Multilingual
Languages:
Dutch English Portuguese Spanish
Availability:
Freely Available
License:
Open Source
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Language Identification
-
Paper title:VarClass: An Open-source Language Identification Tool for Language Varieties
-
Paper track:Written
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Marcos Zampieri | University of Cologne | GB |
| Author 2 | Binyam Gebre | MPI for Psycholinguistics | NL |
| Main Contact | Marcos Zampieri | University of Wolverhampton | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Dutch
Availability:
Freely Available
License:
Creative Commons
Size:
61 KByte Production Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
-
Paper title:Using Tweets for Assigning Sentiments to Regions
-
Paper track:Short papers (4 pages)
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Erik Tjong Kim Sang | Meertens Institute | NL |
| Main Contact | Erik Tjong Kim Sang | Netherlands eScience Center | None |
Documentation:
<Not Specified>




